add Qwen2-VL static generation #1512

Spycsh · 2024-11-22T06:58:34Z

What does this PR do?

Add Qwen2-VL static generation.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Spycsh · 2024-11-22T07:03:03Z

The pipeline test will not pass until optimum-habana matches the latest changes in transformers huggingface/transformers#34769, namely the task name image-to-text ==> image-text-to-text in examples/image-to-text/run_pipeline.py for many of the VLMs. I have currently validated the pass using my own test script https://github.com/Spycsh/qwen-vl-hpu/blob/main/qwen2_vl.py.

jiminha · 2024-11-25T18:01:32Z

age-to-text ==> image-text-to-text in examples/image-to-text/run_pipeline.py for many of the VLMs. I have currently validated the pass using my own test script https://github.com/Spycsh/qwen-vl-hpu/blob/main/qwen2_vl.py.

Are you saying we need transformer4.47 for this to work or can you update the examples/images-to-text/run_pipeline to support both cases?

optimum/habana/transformers/generation/utils.py

jiminha · 2024-11-25T23:29:51Z

@tthakkal Could you also review this please. THanks.

Spycsh · 2024-11-27T03:15:41Z

age-to-text ==> image-text-to-text in examples/image-to-text/run_pipeline.py for many of the VLMs. I have currently validated the pass using my own test script https://github.com/Spycsh/qwen-vl-hpu/blob/main/qwen2_vl.py.

Are you saying we need transformer4.47 for this to work or can you update the examples/images-to-text/run_pipeline to support both cases?

Yes. An update to latest transformers is needed here. run_pipeline.py also need to be updated correspondingly. I will look into this and get back to you later.

…nto qwen2_vl

Spycsh · 2024-12-09T03:42:45Z

Hello, with some small fixes, now the example can be run with following command under transformers 4.45.2 (current optimum-habana compatible version)

python3 run_pipeline.py     --model_name_or_path Qwen/Qwen2-VL-2B-Instruct     --bf16

Kindly review this at your convenience. Thank you! :)

vidyasiv

please add/update relevant test(s) for new model: https://github.com/huggingface/optimum-habana/blob/main/tests/test_image_to_text_example.py

Spycsh · 2024-12-11T05:49:10Z

Hi @vidyasiv @jiminha , I have added the tests and README, tested the Qwen2-VL-7b and Qwen2-VL-2b with GAUDI2_CI=1 RUN_SLOW=1 python -m pytest test_image_to_text_example.py -v -s -k Qwen/Qwen2-VL-2B-Instruct , and also fixed the issue when using HPU graph and made HPU graph enabled as default. Now the perf with warmups should be good enough.

Please review at your convenience. Thank you! :)

vidyasiv

thanks @Spycsh for adding tests

vidyasiv

lgtm (only reviewed from test perspective), approving to remove request changes

Spycsh and others added 20 commits November 4, 2024 06:37

draft qwen2-vl on hpu

b0768e3

fix name

0317aa1

prepare

aaac6bc

baseline prepare

55b317f

prefill pass

47e8763

decode pass, fix perf issue

4587610

remove useless sync with hpu graph

05b429c

remove debug info

e65c1d6

add comment

c5d44f5

fix

3081a66

ruff

2ec286f

add test

e9b69b9

fix img path

ae7c395

fix img path

0e848c6

merge

a72b930

fix

d8ed9f0

revert assistant

bd73757

Merge remote-tracking branch 'source/main' into qwen2_vl

1bd5895

add default

5e9dda1

Merge branch 'huggingface:main' into qwen2_vl

5106dc3

Spycsh requested review from ssarkar2, bhargaveede, vivekgoe and regisss as code owners November 22, 2024 06:58

jiminha reviewed Nov 25, 2024

View reviewed changes

optimum/habana/transformers/generation/utils.py Outdated Show resolved Hide resolved

jiminha requested changes Nov 25, 2024

View reviewed changes

optimum/habana/transformers/generation/utils.py Outdated Show resolved Hide resolved

Spycsh added 7 commits November 26, 2024 22:26

merge

7ab207e

explicit synchronize without hpu graph

10baae2

Merge branch 'qwen2_vl' of https://github.com/Spycsh/optimum-habana i…

c04b90a

…nto qwen2_vl

merge

5533d8c

overwrite preprocess

d6d7e47

cleanup

21eee26

remove hpu graph

c41e2b8

vidyasiv suggested changes Dec 10, 2024

View reviewed changes

Spycsh added 2 commits December 11, 2024 03:49

Merge remote-tracking branch 'source/main' into qwen2_vl

c80bbf6

fix hpu graph issue and add test

f7cfe0b

clear blank line

b83e95c

vidyasiv reviewed Dec 11, 2024

View reviewed changes

vidyasiv approved these changes Dec 11, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add Qwen2-VL static generation #1512

add Qwen2-VL static generation #1512

Spycsh commented Nov 22, 2024

Spycsh commented Nov 22, 2024

jiminha commented Nov 25, 2024

jiminha commented Nov 25, 2024

Spycsh commented Nov 27, 2024 •

edited

Loading

Spycsh commented Dec 9, 2024

vidyasiv left a comment

Spycsh commented Dec 11, 2024 •

edited

Loading

vidyasiv left a comment

vidyasiv left a comment

add Qwen2-VL static generation #1512

Are you sure you want to change the base?

add Qwen2-VL static generation #1512

Conversation

Spycsh commented Nov 22, 2024

What does this PR do?

Before submitting

Spycsh commented Nov 22, 2024

jiminha commented Nov 25, 2024

jiminha commented Nov 25, 2024

Spycsh commented Nov 27, 2024 • edited Loading

Spycsh commented Dec 9, 2024

vidyasiv left a comment

Choose a reason for hiding this comment

Spycsh commented Dec 11, 2024 • edited Loading

vidyasiv left a comment

Choose a reason for hiding this comment

vidyasiv left a comment

Choose a reason for hiding this comment

Spycsh commented Nov 27, 2024 •

edited

Loading

Spycsh commented Dec 11, 2024 •

edited

Loading